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1^ We prove the strong consistency and the asymptotic normahty of the maximum hkehhood 

estimator of the parameters of a general conditionally heteroscedastic model with a-stable 
innovations. Then, we relax the assumptions and only suppose that the innovation process 
converges in distribution toward a stable process. Using a pseudo maximum likelihood 
estimator with a stable density, we also obtain the strong consistency and the asymptotic 
^-H normality of the estimator. This framework seems relevant for financial data exhibiting heavy 

tails. We apply this method to several financial index and compute stable Value-at-Risk. 
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1 Introduction 



ARCH models, introduced by Engle (1982) and generalized by Bollerslev (1986) are some of 
the most popular models for explaining financial time series. In these models, the time series is 
stationary but possesses a time varying conditional variance, this property can be used to explain 
some of the stylized facts that can be found in financial series. The GARCH modeling explains 
the volatility clustering but it also explains a fraction of the leptokurticity that can be found in 
financial time series. Empirical evidences can be found in the survey article by Shephard (1996). 
The most widely used estimator for the parameters of the GARCH model is the Gaussian Quasi 
Maximum Likelihood Estimator (QMLE). To implement this estimator, the Gaussian density is 
used to compute the likelihood of the model, even if the exact distribution of the error process 
remains unspecified. Under appropriate assumptions, the Gaussian QMLE is Consistent and 
Asymptotically Normal (CAN), see Berkes et al. (2003) or Francq and Zakoian (2004). 

Most of the assumptions required for the Gaussian QMLE to be CAN are mild, since one 
does not need to specify the true distribution of the error process, the model is less risky to be 
misspecified as in the Maximum Likelihood Estimation (MLE) case. The only assumption that 
can be challenged is the requirement that the error process possesses a finite fourth moment. 
The GARCH model and its derivatives are mostly applied to financial data which are known to 
be heavy tailed. Mandelbrot (1963) and Fama (1965) found that the unconditional distributions 
of most financial returns are heavy tailed and therefore do not necessarily possess a finite fourth 
moment. Now even if the GARCH modeling explains a part of the leptokurticity of the financial 
time series, the residuals are often found to remain heavy tailed. For this reason, there were 
several attempts to use GARCH models with non-Gaussian innovation, see Berkes and Horvath 
(2004) for a general approach. GARCH models with heavy tailed distributions have been studied, 
Bollerslev (1987) use the student t distribution and Liu and Brorsen (1995) used an a-stable 
distribution for the error process and studied the model empirically, see also Mittnik and Paolella 
(2003), Embrechts et al. (1997). 

In this paper we study a stable Maximum Likelihood Estimator (MLE) of a general con- 
ditionally heteroskedastic model in which the errors follow a stable distribution. To the best 
knowledge of the author, the CAN property of the MLE of such a model with stable innovation 
has not been proven, even in the GARCH case where the model was only studied empirically. 
Here we prove such a result under a few assumptions about the functional form of the volatility 
process. By specifying the distribution of the error process {r]t)t to be a-stable, we obtain a less 
general method than the Gaussian QMLE but we do not need any moment assumption and the 
model takes into account the fact the data can be heavy tailed. 

The Gaussian QMLE possesses the robustness property that even if the errors are not 
Gaussian, provided that their distribution is in the domain of attraction of the Gaussian law, 
the QML estimator is still CAN. We want to obtain a similar property for the stable GARCH 
model. Since the only probability distributions to possess a domain of attraction are the Gaussian 
distribution and the family of stable laws, we use this fact to obtain a robustness property for 
the stable estimation. In other words, we study the asymptotic behavior of the MLE written for 
stable innovations when the error process is not stable but close to a stable distribution. With the 
Generalized Central Limit Theorem (GCLT) (see Gnedenko et al. (1968)), we can characterize 
the domain of attraction of a stable law. A sum of i.i.d random variables with certain properties 
will converge in distribution to a stable variable. If the innovation process can be written as 
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a sum of variables, then if the sum converges, it converges in distribution toward a stable law. 
We use this property to give a more general result than the stable MLE. We prove that if the 
innovation process is not stable but converges in distribution to a stable variable, then the stable 
MLE (which in this case is a pseudo MLE) is still CAN. 

We will study a general class a conditionally heteroscedastic model, defined by 

= (1.1) 

where (ej)t is the observed process ( et G M), {r]t)t is a sequence of independent and identically 
(i.i.d) random variables (the error process), is a parameter belonging to a parameter space Q 
and g : x Q M^. This model contains most of the numerous derivatives of the GARCH 
model that have been introduced such as EG ARCH, TGARCH and many others, see Bollerslev 
(2008) for a exhaustive (at the time) list. Model (1.1) contains the classical GARCH(p, g) model 
given by 

g p 

af = u; + J2 "^^^ti + E ^i^?-r (1-2) 
i=i j=i 

The plan of this paper is as follows. We recall useful results concerning the stable distribution 
in Section 2. In the third section, we study a conditional heteroscedastic model with stable 
innovation and prove that the MLE is stable. In Section 4, we consider the case where the stable 
density is used to compute a pseudo MLE when the error process is not stable but converges in 
distribution toward a stable process. Then, we present in Section 5 some simulation results and 
some financial applications. 



2 Properties of stable distributions 

Since the pioneer work of Mandelbrot, the class of stable distributions is commonly used in 
finance and in other areas such as engineering, signal processing and many other. There are 
empirical evidences that some financial processes, denoted {Xt)t, have regularly varying (heavy) 
tails, that is, P [Xt > x] ~ Kx~°', when |x| — )• +oo, where is a constant and a £ (0, 2). Such 
a process has infinite variance, therefore the standard Central Limit Theorem (CLT) cannot be 
applied. Fortunately, the CLT can be generalized. An iid random process {Xt)t with regularly 
varying tails with index q < 2 is in the domain of attraction of a stable law, i.e. that there exist 
sequences (an)n and {bn)n such that 

— ^— — — - — bn Y, where y is a stable law with tail parameter a. 

an 

Only stable distributions possess a domain of attraction. Obviously, the Gaussian law is a stable 
distribution since the CLT states that every random variable with finite variance is in the domain 
of attraction of the Gaussian law. The Gaussian distribution is a particular case of a stable 
distribution with a = 2. 

The formal definition of a stable variable is quite simple: non degenerate iid random variables 
(Zt)t are stable if there exist a„ > and 6„ such that — 5^ = Zi. For a stable law, 

there exists, in general, no closed form of the probability density function. A stable variable 
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is characterized by four parameters, the previously mentioned tail exponent a, a parameter of 
asymmetry \(3\ < 1, a location parameter ^ G M and a scale parameter 7 > 0. When /3 = 0, the 
distribution is symmetric about /i. There are several special cases apart from the Gaussian case 
with a = 2, where the density is explicit. A stable distribution with a = 1 and /? = is a Cauchy 
distribution. When a = 1/2 and /3 = 1, we obtain a Levy distribution. 

Though the density of a stable variable cannot be written in closed form in the general case, we 
can write its characteristic function, in the (M) parametrization of Zolotarev (1986). The random 
variable X is called stable with parameter = (a, /3, 7) (we write X ~ S{ip) = S{a, /3, ^u, 7)) if 

exp|-|7t|" + 7"i/3tan('^]t(|t|"-i-l) +ifit\ if a /I, 
<p4t) = E[exj>itX] = { 2 1 ^^-^^ 

exp < — |7t| — ^if3t— log |7t| + ifit > if a = 1. 

For other parameterizations or more properties on stable distributions, see Zolotarev (1986) or 
Samorodnitsky and Taqqu (1994). The parameterization in (2.1) possesses the advantage of 
being continuous and differentiable with respect to all parameters, even for a = 1. Using the 
inverse Fourier transform, we can express the density f{.,ip) with the characteristic function 

fix, a, P, //, 7) = /(x, ^) = ^ I e-^*>^(i)cii. (2.2) 

2.TT Jm. 

From Bergstrom (1952), we give a series expansion of the stable density which will be useful to 
easily obtain properties of the stable distribution and to numerically compute the stable density. 



Proposition 2.1. For a < 1, we have 



X sm 



2\k/2 

L -r ; 

; + r 

, Q7r\ 
k [ arctan r H — — j + [ka + l)Trlx<-T 



TT— ■ ■ kl Ix + rl'^^+i 



(2.3) 



with T = /3tan For a > 1, the latter series does not converge but the partial sum of (2.3) 
provides an asymptotic expansion when \x\ tends to infinity, i.e. the remainder term has smaller 
order of magnitude (for large \x\) than the last term in the partial sum. In the case a > 1, we 
have another convergent series expansion given by 



fix, a, /?, 0, 1) = — ^ n cos 



k + 1 kir 

arctan r 

a 2 



(2.4) 



This proposition can be proven as in Bergstrom, the parameterization differs but the idea 
is the same. These series expansions will be used to numerically compute the stable density. 
Depending on the parameters a and /3 and on the value of x, the series (2.3) or (2.4) will 
efficiently approximate the density /. If these series do not provide a good estimation, we can 
use the Fast-Fourier Transform (FFT) or the Laguerre quadrature, see Nolan (1997) or Matsui 
and Takemura (2006). 
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In the last proposition, the parameters /x and 7 were fixed to and 1. To obtain a more 
general formula we can use the following relation 

/(x,a,/3,M,7) = -f(^^,a,(3,0,l) . (2.5) 
7 V 7 / 

From (2.2), (2.3) and (2.5), it follows that the density of a stable random variable is infinitely 
often differentiable with respect to x, a, f3 and fi. Prom the asymptotic expansion (2.3), we have 
the tail behavior of the density / and all its derivatives. When x — )■ ±00, 

/(x,^)~K|xr"-\ (2.6) 

f'{x,iP) = ^{x,4^)r.K\x\-^-\ (2.7) 

^{x,^l;)r.Klog{\x\)\x\-"-\ (2.8) 

||(x,^)~K|xr"-\ (2.9) 

|^(x,^)~K|x|-°-2, (2.10) 

where X is a generic constant, which is not necessarily the same depending on whether x — )■ +00 
or X — ^ — cxD. 

The idea of using stable laws comes from the fact that only a stable variable possesses a 
domain of attraction. The Gaussian distribution is a particular case of stable distribution (with 
a = 2), its domain of attraction contains all distributions with finite variance. The following is a 
CLT for heavy tailed regularly varying distributions in the particular case of a variable in the 
domain of normal attraction of a stable law. 

Theorem 2.1 (Gnedenko et al. (1968), Theorem 5, § 35). If the process {Xt)t is iid with 

P [Xt > x] ~ Kix-" when x +00, (2.11) 
P [Xt < x] ~ K2|x|"" when x -00, (2.12) 

with a £ (0,2), Ki > and K2 > 0, then, with 

^^KTTi^' «=|-«^(«)(^1 + ^2)C0S^| , 

r(i-a) 



M{a) 



when a < 1, 
a 

r(2 - a) 



a{a — 1) 



when a > 1, 



we have 



1 " 

Y.iXt-m)^Z, (2.13) 



where m = EXi if a > 1, m = if a < 1 and Z ~ S{a, /?, /? tan ^,1). 

The following theorem, due to Gnedenko et al. (1968) (Theorem 2, § 46), gives a uniform 
version of the previous result. 
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Theorem 2.2. Under the assumptions and with the notations of Theorem 2.1, if Xt has a 

density and if this density is of bounded variation, we have 

sup — /(x, — ^ 0, whenn^+oo, (2-14) 

where fn is the density of ^,Ji/a J27=i i-^t — fn) and f{-,ip) is the probability density of Z with 
%[) = (a,/3, /3tan l) as defined in Theorem 2.1. 

The previous theorem has been extended by Basu and Maejima (1980) as follows. 

Theorem 2.3. Under the assumptions and with the notations of Theorem 2.2, if the characteristic 
function w of Xi is such that 

/ du < oo, 

for some integer r > 1, then for < 6 < a, we have 

sup {l + \x\)^\fn{x) - f{x,i))\ ^0, whenn^+oo. (2.15) 

3 Conditionally heteroscedastic model with stable innovations 

In this section, we study the properties of the ML estimator, for the general class of con- 
ditionally heteroscedastic models defined in (1.1) with a stable error process. The probability 
distribution of {r]t)t is a stable law with parameter ip = {a,/3,iJ,, 1). For identifiability reasons, 
the parameter 7 has to be fixed to 1. 

Since we work with a general model, we make some general assumptions which can be made 
more precise for explicit models. We will, in particular, consider the GARCH(p, q) model. We 
suppose, 

AO {et)t is a causal, strictly stationary and ergodic solution of (1.1). 
Let 

) £n denote observations of the process (et)t. The true parameter of the model 
is denoted tq = (0q,'0o)'i where is in M™ and parameterizes the known function g, x/jq = 
(ao, /3o, Mo)' is the parameter of the stable density, the fixed parameter 70 = 1 being omitted. 
We still denote by f{.,'ijj) = /(., a, /3, /i) the density of a stable law and we also keep this 
notation for the stable characteristic function. The parameter tq belongs to a parameter space 
T = e X Ax B X C such that A C ]0, 2[, B C ]-l, 1[ and C C M. 
We define the criterion, for r = (9',ip'y £ T: 

inir) = -Y^ltir) with l,{r) = -loga^{e) -log f(Jl-,xl^), 

where the at are recursively defined using some initial values and 

5"t (6*) = g (et-i, . . . , ei, eo, e_i, ...-,6). 

We also define erf (^) = g {et-i, et-2, . . . ; ^). Let Tn be the MLE of model (1.1) defined by: 

r„ = argmin /„(r). (3.1) 
rer 

We define (0) = ^^(0), 4>t^,j{e) = ^£|t(^) and <|)t,^,j,k{^) = ^/(g) ae^lf^ae^ (^) 
and we state some assumptions on the function g and the parameter space T. 
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Al There exists a; > such that, almost surely, for any G 0, (Tt{0) > uj_. 

A2 For t > 1, sup \a^{9) - a'^{9) \ < Kp^, where is a constant and < p < 1. 

See 

A3 Vt, at{0) = (TtiOo) implies 9 = ^o, a.s. 

A4 The parameter space T is a compact set and tq S T. 

A5 There exists s > such that -Elejl* < +oo. 

A6 For any compact subset 0* in the interior of and for (i, j, k) G {1, . . . we have 
E sup \(t)t,i{0)\ < +CX), E sup \(t)t,i,j{0)\ < +0O, E sup \(l)t,i{e)(t>tj{e)\ < +00, 



E sup 

eee* 



^ sup \<pt,i{^)(l)t,j{0)(l}t.k 
eee* 



eee* 



6»e0* 



^Jt4{^)^t,j,ki0)\ < +00, sup \(l)t,i,j,k{0)\ < +00, 

eee* 



< +00. 



A7 For t > 1, sup 

See 



■{0) 



{9) < i^/,*, and sup ^(0) - ^(0) < i^p^ 



6*66 



Oct 

A8 The components of are linearly independent. 

We prove that the estimator r„ is CAN, the first result establishes the consistency, then with 
additional assumptions, we obtain the asymptotic normality of the estimator. 



Theorem 3.1. Under Assumptions A0-A5, the estimator Tn is consistent, 



To 



a.s. 



(3.2) 



//, in addition A6-A8 hold, 



with J = E 



drdr' 



E 



V^iTn - To) 4 (0, , (3.3) 

|^(To)i^(ro)], where kir) = 1 log ^^(0) - log / V^) . 



Remark 3.1. The numerous assumptions of this theorem are due to the fact that Model (1.1) 
is very general. For more specific formulations, some of theses assumptions vanish. For exemple 
in the case of the GARCH(p, q) model of Equation (1.2), Assumptions Al, A2, A3, A5, A6, 
A7 and A8 are obtained in Francq and Zakoian (2004). 

Concerning Assumption AO, in the case of the GARCH(p, g), we require the top Lyapunov 
exponent associated to the model (see for instance Berkes et al. (2003)) to be strictly negative. In 
the case p = q = 1, we draw the stationarity zones for the parameters a and b for different values 
of a. Here, we use a symmetric stable distribution (i.e. /? = 0). In Figure 1, we numerically 
obtained the strict stationarity zones which for each a, is the area under the curve. If a = 2, 
this is the stationarity zone for a GARCH(1,1) model with Gaussian innovation but in the case 
a < 2, the strict stationarity zone becomes smaller as a decreases. This can be explained by 
the fact that the smaller a, the thicker the tails. Then if the parameters a and b take too large 
values, the persistence of at is too strong and at explodes to infinity. 
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Figure 1: Strict stationarity zones for a GARCH(1,1) model with a-stable innovation. The curves 
correspond, in decreasing order to a = 2, a = 0.6. 

4 When the innovation process converges in distribution to a 
stable distribution 

In this section, for clarity purpose, we will enunciate the results for a GARCH(p, q) model 
(Model (1.2)). The same results could be obtained for a more general model but at the cost of 
some technical assumptions on the function g. We write a different version of Model (1.2) with 
an innovation process {j]nt) which now depends on n. We have, 

{^nt = <^ntVnt 
alt = (^o + Yl ^Oi^lt-i + bojCTnt-j, Vt e Z, Vn G N, ^^'^^ 
i=l j=l 

where the process {r]nt)t is iid with p.d.f. fn and converges in distribution toward a stable 
variable with parameter ■0o = (o^o, Po^ f^'o)' ■ This assumption will be made explicit below. As 
in Section 3, the parameter 70 is omitted and fixed to 1 for identifiability reasons. The true 
parameter of the model is tq = {9'q,iP'q)' , where Oq = {ojq, aoi, . . . , aoq, 601, ... , bop)' belongs to a 
parameter space G C (0, +00) x [0, +00)^'''''. The parameter tq belongs tor = 0x>lxi?xC, 
with A, B, C as in Section 3. Let the polynomials Ae{z) = Yl'i=i '^i^^ ^iid Be{z) = 1 — YTj=i ^j^'' 
where = (cj, ai, . . . , Og, 61, ... , hp)' . For 6 such that Z)j=i < 1 ^^'^ ^ i £ {1) • • • 
define the function Onti^) — k%T ~^ ^e^{^)^d{L)e^t, where L denotes the lag operator. 



We suppose that the process {r)nt)t is iid for every n G N, but we need a stronger assumption. 
Define for t E 7., = a {Un^fqJ-'nt), where Tnt = c {Vnu, u <t — 1} and suppose that for any 
t € Z and for any n G N, ijnt is independent of J^f 

We define a pseudo maximum likehhood estimator. The density of {r]nt)t is not specified but 
we suppose the convergence of this process to the stable distribution with p.d.f. /(., i/jq), where 
tpQ is an unknown parameter. We use this density to build a pseudo MLE. We have for r G F, 

1 " / \ 

Lir) = - V Intir) with [„t(r) = log ant{0) - log / ( , ,tp) , 

where the {d'nt{d))t are recursively defined using some initial values. We define 

Tn = argmin /„(r). (4.2) 

rer 

We have kept the same notations as in the previous section because all the involved quantities 
are defined in the same way and play the same role. The objects of this section simply display 
an additional n subscript. 

We define 7n as the top Lyapunov exponent associated to Model (4.1) and 7 as the top 
Lyapunov exponent associated to the model 

et = (Ttrjt 

at =uo + Y^ aoi4-i + ^0j<7t-j, G Z, ^^'^^ 
i=i j=i 

with rjt S{iIjo). The top Lyapunov exponent will be more precisely defined in Section 7. 

Bl To G r and F is a compact. 

B2 7 < and V0 G 6, Ej=i bj < 1. 

B3 There exists 6 > 1 such that for any n G N, E\r]nt\^ < +00 and sup (1 + |a;|)^|/n(a^) ~ 
/(x,Vo)| ^0. 

B4 If p > 0, Ag^^{z) and B0f^{z) have no common roots, ^6io(l) / and aoq + ^op / 0. 

B5 We have sup ae„(/i) < Kp^, where ae^{h) for /i G N is the strong mixing coefficient of the 

nGN 

process (e„t)t. 

o o 

B6 To GT, where T denotes the interior of T. 
Theorem 4.1. Under Assumptions B1-B5, the estimator Tn is consistent, 

Tn — > To, a.s. (4.4) 

n— >-+oo 

//, in addition B6 holds, 

V^(r„-To)4Ar(o,J-i), (4.5) 
= E [|^(ro)|^(To)l, where lt{T) is defined in the proofs. 



with J = E 
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Remark 4.1. The variance-covariance matrix in Theorem 4.1 is the same as in Theorem 3.1. 
There is asymptoticahy no cost for not specifying the true distribution of the innovation and 
instead assuming that the process converges in distribution to a stable law. 

Remark 4.2. The required assumptions for this result are very mild, Bl, B2, B4 and B6 are 
also needed for the classical Gaussian QML. Assumption B3 is specific to the problem and is 
verified in the case described hereafter where the innovation can be written as the sum of an iid 
process, 

r^n 1 = 1 

where for any t G Z, {i'it)i is iid and where {kn)n is an increasing sequence of integers and with 
Q G (1,2) such that there exist Ki and K2 such that 

P [vit > x] ~ i^ix^", when x — t- +00, 
P [I'lt < x] ~ K2X~°', when x — )■ —00, 

then if the density of satisfies the assumptions of Theorems 2.2 and 2.3, the Assumption B3 
is verified by Theorem 2.3. 

Remark 4.3. In the Gaussian QML case, the asymptotic inverse variance-covariance matrix J 
depends on the unobserved distribution of the process {r]t)t- Here the matrix J depends on the 
limit in distribution of the innovation process {r]nt)t- We can define an estimator for the matrix 
J, based on the process {ent)t and prove that this estimator is consistent. 

Remark 4.4. About Assumption B5, for each value of n, the fact that there exist constants K 
and p such that ae^{h) < Kp^ has been proved by Boussama (1998). We only assume that this 
is also true for sup a^^ih). 

neN 

Theorem 4.2. Define Jn = ^ Z]r=i g^g"'/ (''"»)• With the assumptions of Theorem 4-1, we have 

Jn — > J, a.s. (4-6) 



5 Numerical experiments 

In this section, we describe a simulation experiment which aims at studying the behavior of 
the pseudo MLE for finite samples, and for an innovation process whose distribution is close to a 
stable distribution. We use the algorithm of Chambers et al. (1976) to simulate stable processes 
and Proposition 2.1 to compute the stable density. 

We want to verify that even if the model is misspecified, that is if we use a stable MLE 
when the true distribution of the innovation process is not stable, the GARCH coefficients are 
still correctly estimated. We use a Student distribution with degree of freedom a (which by 
Theorem 2.1 is in the domain of attraction of a stable law of parameter a) to build an innovation 
process of the form 

k=l 
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(K) 

Using the results of Section 2, we obtain that, when K tends to infinity, r]l converges in 
distribution toward an alpha stable law. The problem is that, for identifiability reason, the 
parameter 7 of the stable distribution cannot be estimated and has to be fixed to 1. When 
K = +00, the process is alpha stable with parameter ip = {a, P, fi,^)' . The parameter 

ijj depends on the degree of freedom of the Student process {vk,t)k and can be calculated. For a 
generic case, we have 7 7^ 1. If we estimate a GARCH model with innovation process {jii^^^^ 
using a stable pseudo MLE method, we would obtain estimates of the parameters of the same 
model but written under a different identifiability assumption. For example, if we aim to estimate 
the model 

(+00) 

the stable pseudo MLE defined in the previous sections will converge toward 9q = {^,^, 60) > 
see Prancq et al. (2011) for more details on reparametrization of GARCH models. Note that 
the estimation of the "GARCH" parameter 60 is not affected by the identification problem. In 
order to compare estimates of the same quantity, it is thus important that the model is similarly 
identified for each value of K. Thereafter, we use the following identifiability condition. 

- If the innovation process {rjtjt is stable, then it is stable with parameter 70 = 1 (we recall 
that, if X is stable with parameter 7 = 70 > then ^ is stable with parameter 7 = 1). 

- It the innovation process {"^1)1 is not stable, we require that, among the family of stable 
distributions, the closest distribution to the distribution of {rit)t in the sense of the Kulback- 
Leibler distance is stable with parameter 70 = 1. 

Thus, for each we estimate the quantity jx, defined such that the innovation process defined 
by ''lt,K — • Sfc=i ^k,t satisfies the identifiability assumption. It is important to note that 
if we use another normalizing constant than jk, the results of the estimation by stable pseudo 
ML are as efficient as in the case where we use jk, the model is simply written under a different 
identifiability condition. 

We generated 1000 samples of size n = 1000 for different values of K [K = 500, K = 
1000, K = 10000, K = 100000 and K = 00) of the following model and estimate its parameters 
by stable MLE (or pseudo MLE). 



0.02e? . + 0.7(7.2 ^ 

(5.1) 



= 0.01 + 0.02ef_i + 0.7a^_i 



1 ^ 

We can summarize the simulation scheme with the following steps. For a parameter Oq, for K > 
and for a student distribution of degree a, 

- Step 1: we simulate 1000 samples of the variable (^ j^l/a J2k^=i then we fit a stable 

distribution on each sample. For each sample s, we denote by ips = {otg, Ps, l^'s^ls)' the 
results of this estimation. 

- Step 2: we compute jx = Es^T 7s- 

- Step 3: we draw 1000 samples of Model (5.1). 
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- Step 4: for each sample s, we estimate ri'^ = (ei'^', Vn(s)') by stable PMLE. 

The results of these estimations are presented in Table 1. For each of the six parameters 
(three for the GARCH dynamic and three for the stable distribution), wc give the quotient of the 
Root Mean Squared Errors (RMSE) of the corrected stable pscudo MLE and of the RMSE of the 

MLE, corresponding to the case K = 00. This statistic is given by Qf = j,^^^ where the 

RMSE for K can be obtained by, for i G {1, • • • , 6} , RMSEf = ^ Es=i° (t] - foi)^ The 
greater Qi is, the better the stable pseudo MLE is with respect to the MLE. In this simulation 
framework, we do not compare different methods of estimation. We use the same method but 
apphed to different data generating processes. Table 1 shows that, concerning parameters uj, a, b 
and a, the - ,4} increase with K. For large values of i^, the RMSE of the misspecified 

model is quite close to the RMSE of the asymptotic case, i.e. the well specified case. Table 1 also 
indicates that the result of the simulation for parameters /3 and S are more surprising. Indeed, 
the RMSE of the estimation of the asymmetry parameter /3 is smaller when K is small. But a 
good estimation of the parameter f3 is not of much information if the parameter a is not well 
estimated. Finally, for large values of K, we find that our estimator is not much affected by the 
specification error on the density used to compute the likelihood. 









a 


= 1.6 










a 


= 1.4 






K 


10 


50 


500 


1000 


10000 


100000 


10 


50 


500 


1000 


10000 


100000 


w 


0.34 


0.59 


0.79 


0.66 


0.89 


0.86 


0.56 


0.68 


0.80 


0.83 


0.78 


0.91 


a 


0.12 


0.24 


0.50 


0.54 


0.82 


0.83 


0.35 


0.61 


0.85 


0.74 


0.81 


0.97 


b 


0.42 


0.61 


0.76 


0.76 


0.94 


0.99 


0.76 


0.92 


0.97 


0.93 


0.94 


0.97 


a 


0.17 


0.31 


0.58 


0.70 


0.91 


1.03 


0.57 


0.84 


0.93 


0.94 


0.95 


0.96 




2.45 


1.45 


1.27 


1.22 


1.05 


1.01 


1.29 


0.92 


0.84 


1.12 


0.86 


1.04 


5 


1.26 


0.96 


0.96 


0.87 


0.89 


0.86 


1.07 


0.96 


0.90 


1.09 


0.90 


0.93 



Table 1: Ratio of RMSE for the parameters of the model for several values of K and a. 



6 Application to financial data 

In this section, we consider the daily returns of several indices and currency rates, namely 
the EURUSD, JPYUSD, DJA, DJI, DJT, DJU, CAC, FTSE, NIKKEI, DAX, S&P50 and the 
SMI. A GARCH(1,1) model with stable innovations is estimated on each of these series. The 
samples extend from January 1, 2008 to December 31, 2010. The estimated a's are lower for 
the period before 2008 so we only kept three years of data. Table 2 shows the results of these 
estimations with the standard deviation in parenthesis. We can see that even if the GARCH 
modeling explain a fraction of the leptokurticity of the series, the residuals still possess heavy 
tails since in most cases a is around 1.8 and thus different from 2 (except for the NIKKEI). 
When a = 2, the parameter /3 cannot be identified. 
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Index 


UJ X 


10^ 


a X 10^ 


b 


a 





M 


EURUSD 


0.133 


( 0.047) 


4.120 


( 0.720) 


0.877 


( 0.025) 


1.900 


( 0.020) 


-0.007 


( 0.480) 


-0.007 


( 0.075) 


JPYUSD 


0.578 


( 0.270) 


3.370 


( 0.770) 


0.667 


( 0.130) 


1.780 


( 0.037) 


-0.148 


( 0.190) 


-0.137 


( 0.062) 


INRUSD 


0.042 


( 0.016) 


2.300 


( 0.630) 


0.912 


( 0.024) 


1.820 


( 0.077) 


0.242 


( 0.230) 


-0.016 


( 0.067) 


DJA 


0.050 


( 0.044) 


4.610 


( 0.540) 


0.895 


( 0.014) 


1.820 


( 0.110) 


-0.039 


( 0.280) 


0.028 


( 0.071) 


DJI 


0.046 


( 0.038) 


4.670 


( 0.400) 


0.893 


( 0.011) 


1.840 


( 0.083) 


-0.501 


( 0.270) 


0.114 


( 0.071) 


DJT 


0.071 


( 0.070) 


3.820 


( 0.560) 


0.916 


( 0.014) 


1.880 


( 0.087) 


-0.060 


( 0.330) 


0.060 


( 0.067) 


DJU 


0.095 


( 0.045) 


4.690 


( 0.700) 


0.887 


( 0.018) 


1.910 


( 0.068) 


-0.296 


( 0.360) 


0.040 


( 0.064) 


CAC40 


0.221 


( 0.092) 


2.970 


( 0.550) 


0.914 


( 0.016) 


1.880 


( 0.072) 


-0.220 


( 0.210) 


-0.010 


( 0.058) 


FTSE 


0.156 


( 0.067) 


3.660 


( 0.610) 


0.898 


( 0.019) 


1.820 


( 0.050) 


-0.222 


( 0.230) 


0.059 


( 0.060) 


NIKKEI 


0.376 


( 0.140) 


5.700 


( 0.490) 


0.863 


( 0.021) 


2.000 


( 0.090) 


NA 


(NA) 


-0.021 


( 0.053) 


DAX 


0.136 


( 0.051) 


2.830 


( 0.510) 


0.922 


( 0.014) 


1.860 


( 0.079) 


-0.252 


( 0.270) 


0.034 


( 0.065) 


S&P500 


0.909 


( 0.250) 


3.870 


( 0.780) 


0.766 


( 0.055) 


1.770 


( 0.057) 


-0.162 


( 0.180) 


0.103 


( 0.062) 


SMI 


0.112 


( 0.049) 


4.850 


( 0.470) 


0.876 


( 0.015) 


1.850 


( 0.077) 


-0.255 


( 0.250) 


0.028 


( 0.064) 



Table 2: Estimation of a GARCH(1,1), standard errors in parenthesis. 

These estimations can be used to compute Value-at-Risk (VaR). If et is the return of the 
series, the VaR with coverage probabihty p at time t is defined as the quantity VaRt(p) satisfying 

¥t[et < VaRt(p)] = p, 

where is the probabihty measure conditionahy to the time t — 1 information set. Using a 
conditionally heteroscedastic model with stable innovation, if r„ = {9'^, is the MLE (or 
pseudo MLE if the innovation is not stable but assumed to be close to a stable distribution), we 
have 

\a.Rt{p) = at-i{0n)F^{p,i^n), 

where -F^(.,^) is the quantile function of a stable distribution of parameter tp (with 7 = 1). Wc 
compare this stable VaR to the VaR computed using a GARCH(1,1) model, estimated with the 
Gaussian QMLE. We compute a Gaussian QMLE on the indices used in Table 2 and obtain the 
Gaussian counterparts of the parameters in this tabic. 

Then we compute the Gaussian VaR and the stable VaR on an outsample data set (from 
January 1, 2011 to January 31, 2012). We give the results for the VaR of level 1% and 5% in 
Table 3. 
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Index 


Level 


=0.01 


Level 


=0.05 




Stable 


Gaussian 


Stable 


Gaussian 


EURUSD 


0.0108 


0.0180 


0.0755 


0.0791 


TPYUSD 

U J. -L \JVJ\-J 


0035 


0035 


0.0177 


0.0213 


INEUSD 


0.0106 


0.0142 


0.0426 


0355 


DJA 


0.0221 


0.0221 


0.0551 


0.0515 


DJI 


0.0147 


0.0221 


0.0551 


0.0588 


DJT 


0.0294 


0.0331 


0.0588 


0.0478 


DJU 


0.0110 


0.0110 


0.0551 


0.0588 


CAC40 


0.0108 


0.0179 


0.0645 


0.0717 


FTSE 


0.0037 


0.0110 


0.0625 


0.0662 


NIKKEI 


0.0114 


0.0114 


0.0342 


0.0342 


DAX 


0.0072 


0.0143 


0.0717 


0.0789 


S&P500 


0.0074 


0.0221 


0.0699 


0.0588 


SMI 


0.0109 


0.0181 


0.0688 


0.0688 


Mean 


0.0118 


0.0168 


0.0563 


0.0563 



Table 3: Frequency of hits for the Gaussian VaR and the stable VaR 

The two methods give very close results for p = 0.05, but the Gaussian method seems unable 
to explain the extremes of the distribution and the stable VaR seems to give better results for 
p = 0.01. In this case and for almost every index, the Gaussian VaR is underestimated. There 
are too many hits in the sample. We can conclude that the residuals of the GARCH model 
estimated by Gaussian QMLE are too leptokurtic to be explained by a Gaussian distribution. 
To conclude, the stable distribution seems to do a better job to explain the tails of the studied 
financial series. 
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7 Proofs 



Throughout the proofs and the paper, we denote by K and p generic constants whose values 
K > Q and < p < 1 can vary from line to line. 



7.1 Proof of the consistency in Theorem 3.1 

Let In (resp. It) be the equivalent of (resp. It) when an infinite past is known, 



1 1 / 

In{r) = -Y.^t{T) with lt{T) = -\ogal{e) -log f( 

^ t=l ^ ^ 



We first prove that the initial values are asymptotically negligible, that is 

= a.s. 



lim sup /„(r) - J„(r) 



(7.1) 



We have. 



sup 



/n(r)-/„(r) 



n 



t=\ 



sup 

rer 



llog^ + log^^^M^ 



where f]t(d) = ^ and r?t(^) = 

The function / is infinitely differentiable with respect to x, therefore, for r = (^', V'O' ^ 
we have 

|log/(77t(^),^) -log/(77t(^),V)| < sup ^'^"^'"^^ 



Next, using the asymptotic expansion (2.6)-(2.7), we obtain that 



tends to infinity and thus that x i-> ^p^''^^ is bounded on M. This can be obtained since for any 

■0 G ^ X i3 X C, the support of the function x i-)- /(x, ^) is M. Under Assumption A4, we obtain 

< oo. Using Assumption Al and A2, it can be seen that sup \'r]t{&) — Jytl^)! < 

rer 

K\tt\ and sup \dt{&) - ot{&)\ < K\et\p*. Thus we have sup | log /(7?t(6'), r) - log /(r?t(6'), r)| < 
rer rer 

K\et\p* and finally 

1 



tends to 0, when \x\ 



sup sup 
rer X 



sup 



/„(r) - /„(r) 



< 



n 



t=i 



In view of the Markov inequality, the Borcl-Cantclli Lemma, the existence of a moment of order 
s for the processus {€t)t (Assumption A5) and Assumption AO, we obtain that |€t|p* converges 
to almost surely when t tends to infinity. Then, using the Cesaro Lemma, we obtain (7.1). 
We now prove that Elt{T) > Elt{To), for r ^ tq. 

atiOo) fivtiO),ij)- 



Ekir) - Eltiro) = E 
< E 

= E 
= 0. 



- 1 



at{9) /(7?i,V^o) _ 

(^o)/(^^*'V'), 



E 



f^t[Oo) J V Me) 
at{e) /(77t,Vo) 
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To obtain the last equality, we used the fact that is in Qt = o- {r/„; u < t — 1}. 



Now, we show that, if Eltir) = Eltiro), then r = tq a.s. We have "^^^^g-j — 'ffrlt'^l) — 



Let ot-i 



The variable (r/t)t has a Lebesgue density, this yields 
Vx G M, at_i/(at_ia;,'0) = /(x, V'o) as. 



1 a.s. 



(7.2) 



We define X as a stable variable with parameters V'O) ^ ~ 'S'(V'o)) and Y = at-\X. Then, 
using (7.2) we show that the pdf of Y conditionally to Qt is f{x,ip), thus Y ~ S{ijj). Now, for 
G M, we write the characteristic function of Y and obtain 



E 



AuY 



E 



that is <^ij){u) = ip^f^{at-iu). Applying the modula to the previous equation yields 

Vit G M, exp{-|u|"} = exp{-|at_in|"°}. 

Therefore we have a = ao and we easily obtain P = Pq and ji = jiQ. We also obtain at-i = 1 
almost surely and we deduce with Assumption A3 that 6 = 9q a.s. 

Now, for r G r, let Vfc(r) be the open ball with center r and radius l/fc, using (7.1), it follows 
that 



liminf inf Ini'^*) = liminf inf Ini'^* 



The ergodic theorem yields liminf inf IniT*) ^ E 

-r*eVfc(r)nr 



inf lt{T*) 



(7.3) 



. When k — > +oo, then 



E 



inf lt{T*) 
-r*eVfc(r)nr 



tends toward E [/t(r)]. Therefore, we have 



Vr ^ TO, 3F(r), liminf inf I„(r*) > Mt(ro). 

T*eyfe(T)nr 

We conclude by a standard compactness argument, using Assumption A4 and obtain (3.2). 



7.2 Proof of the asymptotic normality in Theorem 3.1 
Lemma 7.1. Under the assumptions of Theorem 3.1, we have 

V^|^(To)4Ar(0,J). (7.4) 

Proof. For A G R'"+3, n > and t > 0, let Unt = ^A'|^(ro). We prove that {unt,Qt-i) is a 
martingale difference. We have, for r = {O'jI/j')' G T 



^(r) = -MO)Zt{T), 
dlt dlogf 



(7.5) 
(7.6) 
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withZ,(r) = l + ,,(^)f^. 

Since cr^ (^o) € Qt, (Tt{Oo) and rjt are independent and 



E 



dk 
80 



(ro) 



= E\Ueo)\E\Zt{aUi,)\. 



The function a; h-^ 1 + x is bounded, therefore we have |Zt(ro)| < +00. Moreover, with 

Assumption A6, we have E\(pt{OQ)\ < +00. With (2.8)-(2.10), we obtain Elvntl < +00. 
Now, we have 

'dh, 



E 



09 



{ro)\gt 



And, 



We infer E 



E [Zi(ro)] = 1+ / xf'{x, i;o)dx = 

JR 

§(ro)l = 0. Now, we prove E \^{To)\gt 



0. Note that 



E 



da 



iro)\gt 



Jr da 



The function x ^ /(z, -(/'o) is integrable, therefore 



da 



- / e 



^^(x,^o)rfx, and ^;^ = ^ 



da 



dk 
da 



(ro) 



We have, Va, (/5a,/3o,Mo,7o(0) = 0' thus, -^(0) = 0. Using the same method for |^ and |^ 
we obtain E [untlGt] = 0. 

We now prove that the covariance matrix of the vector of derivatives of It is finite, we have 



E 



dh , .dh , ^ 
^(ro)^(ro) 



-E 



Z^{to) E [</,t(0o)0;(^o)] , 



with E [Z'^{tq)\ =1 + 2 J^xf'{x,ipo)dx + ^ f{x'^o) asymptotic expansions 

in (2.6)-(2.10), we obtain E [Zf{To)] < +00, then by Assumption A6, it can be seen that 



V 



dk 



is finite. 



Using the asymptotic expansion again, we have when x — > ±00, foi tp E Ax B x C 

(x, V') ~ -ft^log |a;|, — — - — (xjil)) ^ K, — {x,^p)^Kx~^, (7.7) 



da 



dp 



d/j, 



therefore, for (z, j) G {1, • • • , 3}, we have E 



dh dk 



is finite. The very same reasoning applies 



for E 



dhdh 



and we deduce V 



dh 
dr 



< +00. 



We now show that this matrix is positive-definite. Suppose that (u', v')' G M"*+^ are such 
that {u',v').^{to) = 0. We have 



+ ^*7(^j 2^^*(^o) = . 



, dlogf 
dijj 



(7.8) 



17 



Now, it can be seen that ^1 + Vtyj^jf^ 



Thus, we have ^u'4>t{9o) = Q and 



G Qt+U v'^ivui^) G Qt+1 and ^n'MOo) G Qt- 



Q{l+Vt 



f{vt,i^) 



ON , dlogf dlogf 
vi— {r)t,a,P) + V2 {i]t,a,/3) + V3 



da 



with = {vi,V2,V3). We have when x — > +00, x^-^^^ 
letting X — )■ 00, we obtain = and Vx G M 



^ ~ K, then using Equation (7.7) and 



Q (/(x, V) + = V2^{X,^) + VS^{X,^). 

Now multiplying both sides of the previous equation by e**^ and integrating on M, we recognize 
the characteristic function of a stable distribution or its derivatives and obtain for t G M, 

dt dp ^""^ dfi ■ 

Then, for t > 0, it follows that 

at"" - i/3tan ^(at" ~ ) = ^^itan — - t) + v^it. 
Therefore, we have Q = and then ^2 = ^3 = 0. Finally, with Assumption A8 we obtain u = ^rn- 



We have, if {u\v') ^{tq) = then ( 



u , V 







m+3 



and the matrix E 



t(^o)|^(ro) 



dh 



IS 



positive. Finally, using the central limit theorem for martingale differences and the Wold-Cramer 



Lemma we obtain (7.4) with A = V 



dh 
dr 



□ 



Lemma 7.2. Under the assumptions of Theorem 3.1, for any compact subset V* in the interior 
ofV, we have 

dHt 



E 



sup 
rer* 



dnTjTk 



< +00. 



(7.9) 



Proof. For ease of notation, the next equations are written without their argument r = {9, V') G T. 
We have 



d%. 



1 5VI 



2 \afdei9j 



- (Pt,i^t,j \{1 + Vt{9) 



dlogf 



vt{9) 



dlogf 



dx 



Vt{9) 



dx 

a^iog/' 

dx'^ 



(7.10) 



d9id^ 
dHt 
d9id9jd9k 



1^ .m ^'l°g/ 



(7.11) 



i<f>t,i(t>t,j,k + (f>t,j(f>t,i,k + 4>t,k4>t,i,j) 



1 



/ 15 



a log/ 9 2 d'^logf 1 3 a^log/' 



5a; 



dx^ 4 « ^ ^ 



+ \h,i,j,k (^ + r]t{9) 



dlogf 
dx 
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We show that 



E sup 



< oo. 



(7.12) 



For any ij: e A x B x C, the function x i-)- x^^^{x,ijj) is bounded. Then, the function 
tp sup x ^^Q^-^ {x,ip) is continuous, thus since ^ x S x C is a compact set, we obtain (7.12). 

X 

Using the same method for r]1{9) ^ g°f ^ {rjt (0) , ^) and r]f (6) ^^"f (r/f {0),'tp) and using Assump- 
tion A6 we obtain that, for any compact subset T* in the interior of T 



E 



sup 



dOiOjOk 



(r) 



< +00. 



With the same reasoning and other calculations, we obtain Equation (7.9). 
Lemma 7.3. Under the assumptions of Theorem 3.1, we have when n — >■ +oo 

1 



sup 
rev 



n 



E 

^ \ drdr 



0, 



7^ 



dTT 



7^ 



□ 

(7.13) 
(7.14) 



Proof. We have 



§(ro)-§(ro) = 2^t^t-2^t^t 



U^t {Zt-Zt)+Zt{j>t-(t^t)}. 



(7.15) 



For -0 £ AxBxC, define the function h as h{x) = l-\-x^^^^{x, ip), we have Zf—Zf = h {fjt)—h{rjt). 
When X =boo, we have h'{x) = 0{x~^), therefore, with the mean value theorem, we have 



Z,-Zf 



< K\fjt - vt\ < K\et\ 



- cr. 



On the other side, concerning the second term in (7.15), we obtain 



< 



< K 



1 






H 

89 


1 1 




ae 


89 





89 



d_al 
80 



+ 



l_dal 
af 89 



Concerning the derivatives relative to the stable parameter it follows with the mean value 
theorem that 



Slog/ Slog/, . 



8a 



8a 



< sup 



8'^ log / 



8x8a 



(x,V') 



\vt - m 
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The derivative of log / with respect to a and x is bounded. Besides, we have 







<K\et\ 


~2 2 









Wt - Vtl 



We can apply the same method for the derivatives with respect to /? and /x. Therefore, using 
Assumption A7, the Markov inequality, the Borel-Cantelli lemma and the Cesaro Lemma, we 
easily obtain (7.13). 

Using (7.10), (7.11), the second part of Assumption A7 and the same techniques as before, 
we obtain (7.14). □ 

Proof of Theorem 3.1. From the definition of the ML estimator r„, we have j^{Tn) = 0, writing 
a Taylor expansion, we infer 

= — (r„) = ^(ro) + -q:^,{^ )(^" - -^o), 



dr 



dr 



where r* is between tq and r„,. Using another Taylor expansion. Lemma 7.2, the almost sure 
convergence of t„ to tq and the ergodic theorem, we obtain 



drdr 



drdr 



a.s. 



Then, in view of Equations (7.5), (7.6), (7.10) and (7.11), we obtain 



E 



drdr 



E 



dlt , .dk 
a7(-o)^(ro) 



By Lemmas 7.1 and 7.3 we can conclude and obtain (3.3). 



□ 



7.3 Proof of the consistency in Theorem 4.1 

Let In (resp. Int) be the equivalent of /„ (resp. Int) when an infinite past is known. 



1 1 
Inij) = - ^ /„t(r), with lnt{T) = - \o^al^{e) - log/ 

2 



e-nt 



We also need to define the equivalent of these quantities when the processus {r]nt)t is replaced by 
its limit in distribution {r]t)t- If {et)t is the stationary ergodic solution of Model (4.3), we define 
^tW = B^) + Se\L)AeiL)el and kir) = i bg ^^(e) - log / (^, V') . 

Assumption B3 can only be used for quantities which depend on a finite number of [rjntjt- 
Therefore, we introduce a^^\9), a truncated version of a'^^{6). For that, we give a vector 
representation of the GARCH(p, q) model as in Bougerol and Picard (1992), 



h.nt + ^ntZnt-l^ 
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where 







tnt = 



Wo 





and 



A, 



nt 



V 

1 








V 



nt 



'-nt 



^nt-q+1 



nt— p+l 






1 











^01 

1 








1 



bopVnt \ 









J 





We also define z^, and At, the counterparts of 6„j and Ant where rjnt is replaced by the 

iid sequence {r]t)t defined in (4.3). Note that 7„ is the top Lyapunov exponent associated to the 
sequence (^nt)tez- Now, we prove that Assumption B2 implies that 7„ is inferior to zero. 
With Lemma 7.4 below, we obtain for any n G N, 



+0O 



Kt + X] -^nt-^nt-l ■ ■ ■ ^nt-k+lhnt-k- 



k=l 



We define the truncated version of z^f For any m G N, 



Kt + X] ^ntAnt-l ■ ■ ■ Ant-k+lhnt-k, ^ Z, Vn G N. 



(7.16) 



(7.17) 



k=l 



In particular, if X{k) is the k^^ element of the vector X, we define a truncated version of a^t, 
that is 



zi^\q + 1) = hntiq + 1) + AntAnt-1 ■ ■ • Ant-k+lknt-kj {Q + 1) 

The quantity z^\q + 1) depends only on {rjnt-i, • • ■ , Vnt-m}- 

2(m) 



(7.18) 



Then we define a^^t (d) for any 9 e O. For this purpose, we introduce another vector 



representation of the model. 



altie) = Cnti9) + Balt-,i9), 



where 



( <yim \ 
^^t-l(^) 

V (^^t-p+i(^) } 







B 



(hi 62 • • • bp\ 
1 ••• 



V ... 



/ 
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By Assumption B2, we have sup p{B) < 1, where p{B) is the spectral radius of the matrix B, 

9e& 

and thus, for any 6 G Q, 

+00 

alie) = Y.B'c^t-kiO). (7.19) 

k=0 

We define for m G N, 

m q 

^T\o) = E ^'(1' ^)&{m), with ci7V)(i) = + E "^iTic? + ^)^lt~^■ (7-20) 

As for z;^\q + 1), the quantity a\j;^\6) depends on a finite number of rjnt', but since every 
^^-iil + 1) depends on several rjnt': cr'^t^\(^) depends on more than m variables rjnt'- To 
be precise, a^[™'^(6') depends on {rjnt-i, ■ ■ ■ ,r]nt~2m+q}- Define also 1^^\t) = \ \og(T^^\9) - 



Lemma 7.4. Under the assumptions of Theorem 4-1, there exists G N such that 

Vn > A^, 7„ < 0. (7.21) 
Besides, there exist /cq G N and A^ € N such that 

X'=SUP E[\\Ank,Ank^_i---Anlf]<l. (7.22) 

n>N 



Proof of Lemma 7.4- With Assumption B2, using the norm = ^ which is a multiplica- 
tive norm and with Lemma 2.3 in Francq and Zakoian (2010), we have the existence of /cq G N 
and of s > such that 

X = E[\\Ak,Ak,.i---Aif] < 1. 

Now for n G N, writing Ant = A{rint) to emphasize the fact that Ant only depends on r]nt, we 
have for s > 

E [\\AnkaAnko-l ' ' ' ^InlH = / WM^l) ' ' ' ^(^^fco)!!'' fn{xi) ■ ■ ■ fn{xko)(lxi ■ ■ ■ dxkg- 

For e > 0, by Assumption B3, we have the existence of A^ G N, such that 



Vn > A, Vx G M, /„(x) < G(x), where G{x) = f{x) + , i^,,^. 



e 

Then, the function A is such that Vx G M, < ||A(j;)|| < Kx"^ and therefore 



feo 

P(xi) • • • A(xk,)f fn[xi) ■ ■ ■ fnixkj < K ^ \Xi\^'GiXi), 

i=l 

Using the asymptotic expansion (2.6) and choosing < s < min we infer 

ko ko 



f Kfl\xi\'^'G{xi)fldxi <+oo, 
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Thus, since fn simply converges to /{.jipo), using the dominated convergence theorem, we obtain 



hm E [\\AnkoAnko-i ■ ■ ■ Ainf] = E [\\AkoAko-i ■ ■ ■ AlW'] = x < 1- 

n— ^+00 

Therefore, for e > 0, there exists N £N such that, for n > N, we have 

E[\\AnkoAnko-l---^nlf] < I - S, 

and thus, x' = sup E [||^„fco^nA:o-i " " " ^nill*] < 1 and we obtain (7.22). 

n>N 

Then, using Lemma 2.3 from Francq and Zakoian (2010), we obtain (7.21). 
Lemma 7.5. Under the assumptions of Theorem 4-1, there exists s > such that, 

sup E\ent\'^'^ < +00, and sup Ea'^^ < +00. 

neN nSN 



□ 



(7.23) 



Proof. For n> N, using the inequality (x + y)'^ < x^ + y^ for x,y > and s < 1, Equation (7.16), 
the fact that the norm is multiplicative, the independence of the processus {r]nt)t and Lemma 7.4, 
we obtain 

{+00 fco "I 
i + Ex''E{^p«irr ■ (7.24) 
fe=0 i=l J 

Now, we prove that there exists s > such that sup £'|r/„f < +c>o. In view of Assumption B3, 

jieN 

we obtain that E\r]nt\'^^ converges toward E\r]t\'^'^ < +00, for s < 6/2. We used the dominated 
convergence theorem again and also the fact that for any n G N, we have E\r]nt\'^^ < +00 for a 
small enough s > 0. Therefore, with (7.24), it follows that 

sup E \\Znt\\^ < +00. 

jieN 

Now, for any n G N and any t £ Z, we have cr^^ < and e^^ < Consequently, we 

obtain (7.23). □ 

Lemma 7.6. Under the assumptions of Theorem 4-1, there exists s > such that, 



Proof. We first prove that there exists s > such that 



supS c7-(0o)-^lT^^(9 + l) <Kp^. 

neN 



(7.25) 



(7.26) 



For m > kQ, let [m/feoj be the floor function of m/kQ {k^ being defined as in Lemma 7.4), we 
have 



S[rn/ko\) 



E ll^nt ■ ■ ■ ^nt-fc+lll ll^nt-fcl 
fc=[m/fcoJ+l 
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The constant s can be taken such that s < 1 and, using the inequality {x + y)* < + for 
x,y > and the independence of the processus {rjntjt^ we infer 



sup E 

n>N 



< J2 sup E[\\Anf-Ant-k+l\\"]sup E[\\b^t_k\\'] 
+00 ko ( ^ » 

< sup E[\\b^,r] Yl x'E sup E\\An,r\ 

"^^ k=[m/ko}+l i=l u^^ J 

defining iV G N and using similar arguments as in the proof of Lemma 7.5. With exactly the 
same arguments, we obtain for any n G N the existence of Kn > and pn < ^ such that 



E 



Thus, there exist K > and p < 1 such that 



(Lm/fcoJ) 



< KnP^- 



sup £^ 

neN 



_ ^(LmAoJ) 



Then, we use the inequality cr^f - ^^^^^(^ + 1) < o-nt - + 1) 

obtain Equation (7.26). We also obtain 

supE[a2,-zi7)(g+l)1 <i^p™. 



< 



(m) 



and 



(7.27) 



We now prove the inequality (7.25). We remark that for any m G N and for any ^ G 0, we 
have o-nt^\0) < Cnt(^)- Then, we have 



sup sup E \ aim - 1 < E sup ^""(1' l)'«up E 



nSN 6»ee 



fc=m+l 



nSN 



sup C^t-fc(^)(l) 

6>ee 



(7.28) 



+ sup S'=(l, 1)^ gsup afsup ErilU_kSup E \{ult-i-k - ^T-i-fel^ + 1))' 



fc=0 



In view of the second part of Assumption B2, we have sup p{B) < 1. Then, using Lemma 7.5, 

9ee 

we obtain 

+00 

Y sup B''{l,iysup E 



k=m+l 



neN 



sup C^t_fe(^)(l) 

See 



Now for the second part of (7.28), Equation (7.27) and the fact that p{B) < 1 yield 



Y sup B^{1, 1)' ^sup afsup Er]ll_i_ksup E 



a^,_,_,-4S_,(g + l) 1 <i^p™. 



Finally, having treated the two terms of the right hand of (7.28), we obtain (7.25). 



□ 
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Lemma 7.7. Under the assumptions of Theorem 4-i, we have for any c? G N and for any subset 
VCT 



E 



E 



E 



inf k{T] 



inf Intir) 
rev 



inf Intir) 



< +00, 



E 



n— >+oo 



E 



inf k{T] 
rev 



inf k{T) 
rev 



(7.29) 
(7.30) 
(7.31) 



Proof. We prove (7.30) in the case d = 1. The other cases and (7.31) can be obtained with 
similar arguments. We will prove the following intermediate results. For any subset V dV 



(i) sup E 

neN 



inf UT)-mf ff(r) 



(ii) E 



inf /,(r)-inf 4™)(r) 



(iii) For any m > 0, E'inf 1^^\t) E'lrd 4™^(t), when n +00. 

We have for any 9 e O, a'^ti^) > oj. Since is a compact set, there exists a; > such that, 
G e, Vt G Z, Vn G N, alt{e) > u and V0 G 6, Vt G Z, Vn G N, (J^^'V) > ^- P^om 
Lemma 7.6 and using the mean value theorem, it follows that 

sup sup E \ogal,{e) - \ogaT\e) < Ksup sup E al^id) - al^^\9) < Kp^. (7.32) 
neN See neN 6»ee 



For G G, let ant{0) = and let a;;;^(e) 



+ 



. We have for s' > 



2{m)s' 



2(m)s' / 



Setting s' = s/2 and using the Cauchy-Schwarz inequality and the results of Lemmas 7.5 and 
7.6, we obtain 

sup sup E aitie) - ai7)^>) < Kp^. 



neN 0ee 



Then, using the independence between <t^j(^) (or cr^j™^) and rjnt and Assumption B3, we obtain 
for any 9 e & 

sup sup E \\r,nt\'' ai^{e) - a^T^^V) 1 < Kp^. 

neN eee 

Defining the function F^{x) = log /(a;^/^', V'), we have, if rjnt > 

log f{ant{e)Vnt,^) - log/ (ai7V)^nt, V') | = {aimr)ntf) - (ai7)^>)|r?„tr' 



The derivative of F is such that = x^^^' ^ '^J^iL' 't-! ■ We have, when x — >■ +00, -g-r 



1/x. 



OF 

Therefore if we take s' < 1 we obtain that -jS is bounded. Then since F is a compact set and 



dx 
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since ip i-> sup -q^{x) is continuous, we obtain sup sup -q^{x) < +00. In view of the mean 
value theorem, it can be seen that 



sup 

rer 

and finally 



log / {ant{e)Vnt, V') - log / (aiTV)^nt, V') < ^l^nt^'sup a^iO) - aiT^^V) 



sup sup E log / {ant{0)Vnt, V') - log / (a^^\0)r]nt, ilA < Kp" 



Using Equations (7.32) and (7.33), we obtain 



sup sup /,,(r)-47)(r) <Kp^ 



(7.33) 



(7.34) 



Now for m G N, for Ki > and for |pi| < 1, for any n G N, there exists fjn,n € T such that 



Int {fm,n) - mf lnt{T) < K-^pf and there exists f^,„ e T such that (^m.n) - mf /^T^C'^) < 
Kipf. Now if /nt (-?m,n) < Int {rm,n), we have 



inf l„t(r) - inf 



inf /„t(r) - Int {Tm,n] 

rev 



+ 



lnt{Tm,n) - mf 4TV) 



inf /„t(T) - Z„t ifm,n) 



Or if (TVn,n) < 4?^ (Tm,n), we have 



inf /„,(r)-inf C^(r) 



< 



inf lnt{r) - Int {rm,n) 



+ 



lnt{fm,n) - inf 4TV) 
rev 



(m) / 



< Kipf + 

Now, if Int {Tm,n) > Int {Tm,n) and (fVn,n) > ("^m.n), we have 



< ifp'' 



inf Intir) - inf 

rev rev 



< 



inf ^nt(T) - Int {rm,n) 
rev 



+ 



+ 



''nt Vm,n) ini l^j {T) 
re V 



We have 



(7.35) 



^nt ('^m,n) 4t ^ ('^m,n) ^ ^nt ("^771,71) 4t ^ {'^m,n) ^ ^nt (''"m,n) 4t ^ (''"m,n) j 

and thus, with (7.34) we obtain E Int ('^m,n) ~ 4t ^ ij'm,n 

) < -ftTp'". Finally, with Equation (7.35) 
we obtain (i), the step (ii) can be obtained in the exact same way. 

[step (iii)] We have, for m G N* and r e T, ^^^(r) = \ \oga'^^^\e) - log / (ai7V)^nt, V') • 
The quantity o\^^\0) depends on a finite number of Tint- More precisely o-'^^\9) is a function 
of {r^nt-k, G {1, . . . , 2m + q}}. Now, from (7.19) we obtain that the expression of CF^t^\o) 
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contains only products of powers of rinf- Therefore, since is a compact set, there exist M > 
and in,..., r2m+q) e N2"^+« such that 



(7.36) 



Using the same arguments, it follows that 



Then, with the asymptotic expansion (2.6), we have Vx G M, f(x,ip) > Kx ° ^ and VV' G 
A X B X C, Vx G M, /(x, ifj) < K. Therefore, there exist (sq, . . . , S2m+g) such that 



Vr G r, if max(M, 7/^)^° . . . max(M, r?4-2m-,)'''"+^ < /(«nt(^)%t, V-) < K. (7.37) 

In view of (7.36) and (7.37), we obtain the existence of M > and > 0, z G {0, . . . , 2m + 
such that 



inf ff(r) 



1+ J2 |^^ilog{max(M,r/2,_.)}| j 



Then, we can apply the dominated convergence theorem as we did before and obtain (7.29) and 
(iii). 



Now, to obtain (7.30), we use (i), (ii) and (iii) and obtain 



lim E 

n— >+oo 



inf Intir) 



= lim lim E 

n— >-+oo m— >-+oo 

= lim lim E 

m—^+oo n— >+cxD 



inf ff(r) 
inf 4T^(r) 

re V 



lim E 

m— >+oo 



inf /^V) 
rev ^ ^ ' 



E 



inf lt{T) 



The limits inversion can be done since the convergence in m is uniform with respect to n. □ 
Lemma 7.8. Under the assumptions of Theorem 4-1, for any subset V CT, we have 



-5^inf Z„t(r)^£; 

m * ^ 1-1= T/ 



inf kir) 



a.s. when n — > +oo. 



(7.38) 



Proof Let Xnt = (inf lnt{r)r ^nd let M„ = ^J2t=i^nt- We also define Sn = M„2 and 

rUn = EXjit- We have ESn = rrin^ and Var^n = ^ 5^/^=0^ C!ov (X„t, X„t_h). We now prove that 
there exists M > such that for any n G N we have 



Gov (x„t, x„t_/i) 

h=0 



< M. 
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i7^ = {inf &\r)r-'^ ^('"^ 
With the step (i) of the proof of Lemma 7.7, we have for any m G N 



As in the proof of the previous Lemma, we define X!^' = ( inf 1]t^'{t))~^ and R!^' = Xnt — 



sup E 



R 



(m) 
nt 



KKp-^ and sup ER^^^^ < Kp"". 

neN 



(7.39) 



Now, for /i e N 

Cov(X„t,X„i_;,) = Cov(4pJ\x„i_;,) +Cov(4LV'J\x„t_;,) . 
Using (7.39), we obtain 

Cov(i^iL^/^J^x„,_,)|<i^p^ 



sup 

neN 



(7.40) 



It can be seen that, for any r G F, lnt{T) can be written as a measurable function of {ent')t'<t- 
Therefore, Xnt-h is also a measurable function of {ent')t'<t-h- Besides, X^^'*^^-'^ is a measurable 
function of (£«*')*'>*- [/i/2j • Thus, we have for /i G N 

CoY{xi!t™,Xnt-h)\<aeA[h/2\). 

Note that is the mixing coefficient of the process (e„f). Therefore, Assumption B5 yields 



sup 

nGN 



.h)\<Kp' 



(7.41) 



Now with (7.40) and (7.41) we obtain sup \Coy {Xnt, Xnt-h)\ < Kp and consequently 

En>i VarSn < +00. 

Then, using the Tchebychev's Inequality, we obtain 



1 



F [\Sn - m„2| > £] < ^ ^ Var5„. 



(7.42) 



n>l 



n>l 



Thus, since the series of Equation (7.42) is convergent we obtain the almost-sure convergence of 
Sn — 171^2 to 0. We also have from Lemma 7.7 the almost sure convergence of m„ to E{ inf (r))+, 

therefore 



Sn^E{mf /t(r))+, a.s. 



We now prove that M„ converges also to E 



(inf /i(r))- 



almost surely. Let g„ = l^/n\ be 



the floor function of ^/n. Since the element of the sum M„ are positives, we have 

^qlSg„<Mn<^{qn+l?Sg^+l. 

2 

Using the fact that ^ converges to 1, we obtain the M„ — > £'(inf Itir))^, a.s. Finally, using 
the same method for the negative part, we can conclude and obtain (7.38). □ 

Lemma 7.9. Under the assumptions of Theorem 4-1, we have 



lim sup In{T) — Inij) < 0, a.s. 

n-*-+oo ^gp 



(7.43) 



28 



Proof. Let a^^ be the vector obtained by replacing <T^j_j by ^^t-i ^^'^ &nt be the vector 
obtained by replacing e^Q, . . . , e^i_ by some initial values. In view of (7.19), we have 



k=l 



Assumption B2 yields sup p{B) < 1, consequently, we have 

See 



sup 
6»ee 



2 ~ 2 



< Kp^ (max(e^o> • • • > 4i-g) + max(a^o> • • ■ > ^ni-p) + l) 



Then, since the random variables max(e^Q, . . . , e^i_q) and max((T^Q, . . . , possess moments 

of order s, by Lemma 7.5, we can conclude as we did in the proof of Theorem 3.1. □ 

Then, the proof can be done in the exact same way as in the proof of Theorem 3.1. Starting 
from Equation (7.3), we can use Lemma 7.8 to conclude. 

7.4 Proof of the asymptotic normality in Theorem 4.1 

We introduce a truncated version of the derivatives of a^^. Prom (7.19), we obtain 



doj 



k=0 

+ CXD 



nt 



dai 



k=0 
•' k=l 



.1=1 



(1), ViG{l,...,p}. 



For m G N, we define 
/ da: 



2 \ (™-) 



du 



{e) = Y.B\i,i), 



da: 



2 \ (™-) 
nt 



k=0 

m 



{9) = J2 B\l, l)4TVi(^ + ^Wnt-k-^, VZ G {1, . . . , g} , 



k=0 



da: 



2 \ V™) 
nt 



fc=l U=l 



Y.B'-'B^^)B'-'c^l,{e) 



(1), VjG{l,...,p} 



(7.44) 
(7.45) 
(7.46) 



where B'^'-'^ is a p x p matrix with 1 in position (1, j) and zeros elsewhere. Then, we define 
and for i G {1, . . . ,p + g + 1}, 05](^) = ("^f^ {B). 



nt 



(0), 
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Lemma 7.10. Under the assumptions of Theorem 4-1, there exists a neighborhood V{6o) of 60 
such that, for any 9 G V{9q) and for {i,j, k) E {1, . . . ,p + q + 1}, we have 



sup E cl>^t,imnt,M - </>iM(^)'^S](^) < ^P'' 

neN 

sup E cPnt,mnt,m<Pnt,k{(^) " ^i;] (^?)</'S(e)0S(^) < K 
n6N 



(7.47) 

(7.48) 

(7.49) 



And 

sup sup E\(j)nt,i{0)\ < +00, sup sup E\(/)nt,i{0)(j)nt,j{9)\ < +00, (7.50) 
eev{eo) neN eev{eo) "gn 

sup sup E\(pnt,i{0)(pnt,j{0)'Pnt,k{^)\< +00. (7.51) 
eey(6lo) nGN 

Proof. In this proof, for clarity purpose, the arguments (9) are omitted {(pnt stands for (t>nt{G))- 
We have for n G N, t G Z, G 6 and i G {1, . . . , g}. 



1 d(^lt 



1 



(7, 



(m) 



< 



1 



(T; 



nt 



nt 



dai 



(m) 



+ 



'a 2 \ 



nt 



doi 



1 



1 



(T, 



nt a, 



2(m) 



nt 



(7.52) 



We begin by the first term of the previous equation, we have 



dalt /^^2\M +0O 
dai 



nt 



doi 



k=m+l 



Then, we remark that cije^^_j.. _i < c„t_fc(l) and that a^^ > u + B'^{1, l)c„t_jt(l) and we infer 



(7, 



nt 



dai 



d(^lt 
dai 



(m) 





+00 


< 


E 




c=m+l 




+0O 


< 


E 




c=m+l 



1 i?^Xl,l)Cnt-fc(l) 



1 rB^(i,i)c„,_fe(i) 



using the inequahty x/{l + x) < x^ for all x > 0. With By B6, we have aoi > 0, thus there exists 

a neighborhood V{9q) of 9n such that inf ai > 0. Then, using Lemma 7.5 and the fact that 

eevido) 

the spectral radius of B is inferior to 1, it follows that 



sup E 

n€N 



1 



a: 



nt 



dai 



dai 



(m) 



(7.53) 



Turning to the second term of Equation (7.52), the mean value theorem applied to the function 
x I-)- x~^/^ yields 



cr; 



nt 



2(m) 
^nt 



< K- 



l/s+l 



(^nt - (^nt 



2(m)s 
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where a is between cr^f and cr^^"*^*. Since d > cr^^'"'**, it follows that 



(7, 



nt (7, 



2(m) 
nt 



< K 



a, 



1 

2(m)+2s 



t2s _ ^2(m)s 



(7.54) 



nt 



Then, we have 



'nt 



9ai 



2(m) 
nt 1 



k=0 



k=0 



and thus, (^) < Now, with (7.54) and using Lemma 7.6, we obtain 



sup E 

neN 



da; 



2 \ (™-) 



nt 

dai 



1 



cr, 



nt (7, 



1 

2(m) 
nt 



< Ksup E 

neN 



1 



a, 



(m) 



nt 



2(m)s 
f^nt 



2s 2(m)s 
'^nt ~ '^nt 



Finally, in view of the previous equation and (7.53), we obtain 



sup E 

neN 



alt dai 



nt 



4"^) 1, da. ) 



If we adapt the proof for the derivatives with respect to bj and u, we obtain (7.47). Now for the 
first part of (7.50), for any n € N with already used arguments, we have 

1 dal,^^^ 4^<K 



E 



alt da, 
1 d^lt 



alt duj 



'nt 



dbi 



'^J k=l 



OJ 



And the first part of (7.50) comes easily. 

Turning to (7.48), we have for (z, j) G {1, . . . ,p + q + 1}^ 



'^nt,i(Pnt,j 



Am) Am)' 
nt,i ^nt,j 



< 



'MtA 



nt,j 



^im)' 
^nt,j 



JntA 



0nt,i| 



Am)' 
nt,j 



Am) 
^nt,j 



By (7.47) and the first part of (7.50), we obtain sup sup 

eev{eo) neN 

other results of the lemma can be obtained with similar arguments. 



< +00 and (7.48). All the 

□ 



Lemma 7.11. Defining 

'dlnt\^^^ 



(l + r?nt^(r?nt,Vo)) 



(ro) = 



dr ) ' \ -^{Vnt,^o) 
and under the assumptions of Theorem 4-1, uje have 



sup E 

neN 



dl 



nt 



dr 



(to) 



5Lt\(™) 



'nt 



(to) 



KKp"" 



(7.55) 
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Proof. From (7.5), we have 



dl 



nt 



89 



(ro)- 



dl 



nt 



89 



(ro) 



(m) 



^ (^1 + Vnt ^^Q^"^ {rint, V-o)) (^nt - ^/'i?^) • 



Now, since ^1 + rjnt^^^^^i'nnt, V'o)) only depends on rjnt, we can apply the dominated convergence 
theorem with Assumption B3 and obtain 



lim E 

n— >H-oo 



^ + Vnt—Q^{Vnt,V0) 



E 



dlogf 



< +00. 



The last inequality has been proved in Section 7.2. Since the function x i-^ (l + x^^^^{x,tpo)^ 
is bounded, it is now clear that 



sup E 

Then, with Lemma 7.10 we obtain 



Slog/ 

^ + Vnt—g^{Vnt,W0) 



< +00. 



sup E 

neN 



81 



nt 



89 



(ro) - (^(ro) 



(m) 



Finally, since 



dtp v^o 



(m) 



l^(ro) we obtain (7.55). 



Lemma 7.12. Under the assumptions of Theorem 4-1, we have 



lim e'^1^(tq) = E^^(tq) and lim E 

n->+oo 8t 8t n->+oo 



f^^nt / x dint / X 

97(^0)^ (to) 



'8t' 



□ 



(7.56) 



Proof. To obtain the result, we first prove that, for any m G N, E(f)^\9o) — )• E(f)[^\9o) when 

n — )• +00. From (7.44), we know that {9o) does not depend on n. Since cr^j > > 0, 

we can apply the dominated convergence theorem and obtain 



E(pi^}{dQ) ^ E4f{9o), when n ^ +00. 
Now, using the same method as in the proof of Lemma 7.10, we infer for i € {1, ■ 

1 / ^'(1,1)4^^(1) ' 



^nt,l+i 

k=0 "^0'* 



where s can be chosen as small as wanted. The same arguments as in the proof of Lemma 7.7 
and the previous equation imply that (p^^i^i is a function of {r]nt-k', 1 < A; < 2m + q} and is such 

that, for any s > 0, there exist K,M > such that (l)nt\+i ^ ^ Ili=^'^ max (M, rj^t-i) ■ Then, in 
view of the dominated convergence theorem, we obtain 



E(j)["^}^-{9o), when n ^ +00. 
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Doing exactly the same for (as in Lemma 7.10) with j G {1, . . . ,p}, we obtain 



E&{0o) E4"'\eo), when n^+oo. 
Since 1 + rjnt ^^Q^^ {Vnt, V'o) only depends on r]nt, we easily obtain 



E 



Slog/ 

1 + ??nt^ [VnUVO) 



Consequently, 



lim E 



dl 



nt 



de 



dx 

(m) 



n— >+oo 



W = «(|) (ft). 



0. 



Now, with the asymptotic expansions (2.8)-(2.10) and with previously used arguments, we also 
obtain the convergence for the derivatives with respect to ip. Finally, inverting the double limit 



Hm lim £; f 

n m \ or 



(m) 



lim lim E 

m n 



(m) 



(^^o)) we obtain the first part of (7.56). It is clear 

□ 



that the second part of (7.56) can be obtained with very similar arguments 
Lemma 7.13. Under the assumptions of Theorem 4-1, we have 

3T r 



(7.57) 



with J = E 



l^(-o)|^(ro) 



{Ynt - EYr 



nt 



Proof. For A G W+i+\ we define Ynt = A'^ff (tq), Yt = A'f^(ro) and z^t = ^ 
We will apply the central limit theorem of Lindeberg to the array (znt) to prove this lemma. We 
obviously have Eznt = and E [z^^] = ^ (^E [Y^^] — E [Ynt]^^- Now, in view of Lemma 7.12, we 
have, when n tends to infinity 



E 



Y 



nt 



E 



Y/ 



< +00, and E [Ynt]^ ^ E [Yt]^ = 0, 



using the results of the proof of Lemma 7.1. The Lindeberg condition remains to be proven. We 
have for any £ > 0, 

n "'If 

and P [|y„t — EYnt\ > £\/n] — > 0, when n — > +oo. Besides, we have sup E \Ynt — EYnt^ < +oo. 
We can conclude and obtain the Lindeberg condition 



n 

[^nt\nt\>e 0> when n +oo. 



It remains to apply the Lindeberg central limit theorem and the Wold-Cramer theorem and we 
obtain (7.57). □ 
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Lemma 7.14. Under the assumptions of Theorem uie have 

Q^In{ro)^-J, a.s. (7.58) 

Proof. Adapting (7.10) and (7.11), we obtain for € {1, . . . ,p + q + 1} 

dXt 1 / 1 d^'jlt ^ . \f, , dlogf 



nt " 

21 



1 / 51og/ 9^ log/ 

:(Pnt,i(Pnt,j'nnt ^ h r?„t 



.2 



dXt 1, a^log/ 



Using (7.46) and (7.47) in Francq and Zakoian (2010) and the same reasoning as in Lemma 7.7 
(defining a truncated version of d'^a'^^/ drdr'), we obtain for (i, j) G {1, . . . ,p + q + 1}^ 



E 



nt 



dndTj 



(ro) 



E 



n— ^+oo 



dndTj 



(to) 



Then, as in Lemma 7.8, we obtain the result. 



□ 



Lemma 7.15. Under the assumptions of Theorem 4-1, there exists a neighborhood V{tq) of tq 
such that for k) £ {1, . . . ,p + q + 1}^ 



1 



^T. sup 



nt 



<r)^E 



sup 



ey(To) dndTjdTk 



a.s. 



ra^rey(ro) dndTjdTk 
Proof. Using the results of Lemma 7.10 and of Lemma 7.2 the proof is straightforward. 
Lemma 7.16. Under the assumptions of Theorem 4-1, we have when n — )■ +oo 



1 



E 



dl 



nt 



t=l 



dT 



dl 



nt 



sup 



n 



E 

t=i 



dH 



nt 



dTdT 



dT 
d%,t 



(to) 



0, 



d 



TT 



(7.59) 

□ 

(7.60) 
(7.61) 



Proof. This lemma can be easily proven using the same arguments as in Lemma 7.3 and 
Lemma 7.9. □ 

Proof of Theorem 4-1- Using Lemmas 7.10-7.16 and the same method as in the proof of the 
asymptotic normality in the ML case, we obtain the result. □ 



34 



7.5 Proof of Theorem 4.2 

For G {l,...,p + g + 4}^, a Taylor expansion yields 



dridTj 



dTjdr. 



(ro) + 



dr'dTidTi 



(r)(r„-ro). 



Using Lemma 7.2, Lemma 7.15, the equivalent of Lemma 7.12 for the second order derivatives 
and the consistency of the estimator r„, we obtain 



n ^ OTiOTj n-^+co 



dTidTj 



(to) 
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